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Abstract —In several application areas, such as human com¬ 
puter interaction, surveillance and defence, determining the 
intent of a tracked object enables systems to aid the user/operator 
and facilitate effective, possibly automated, decision making. 
In this paper, we propose a probabilistic inference approach 
that permits the prediction, well in advance, of the intended 
destination of a tracked object and its future trajectory. Within 
the framework introduced here, the observed partial track of the 
object is modeled as being part of a Markov bridge terminating 
at its destination, since the target path, albeit random, must 
end at the intended endpoint. This captures the underlying long 
term dependencies in the trajectory, as dictated by the object 
intent. By determining the likelihood of the partial track being 
drawn from a particular constructed bridge, the probability of 
each of a number of possible destinations is evaluated. These 
bridges can also be employed to produce refined estimates of the 
latent system state (e.g. object position, velocity, etc.), predict 
its future values (up until reaching the designated endpoint) 
and estimate the time of arrival. This is shown to lead to 
a low complexity Kalman-filter-based implementation of the 
inference routine, where any linear Gaussian motion model, 
including the destination reverting ones, can be applied. Free 
hand pointing gestures data collected in an instrumented vehicle 
and synthetic trajectories of a vessel heading towards multiple 
possible harbours are utilised to demonstrate the effectiveness of 
the proposed approach. 

Index Terms —Bayesian Inference, Kalman filtering, tracking, 
maritime surveillance. Human computer interactions. 


I. Introduction 

I N several application areas such as surveillance, defence 
and human computer interaction (HCI), the trajectory of 
a tracked object, e.g. a vessel, jet, pedestrian or pointing 
apparatus, is driven by its final destination. Thus, a priori 
knowledge of the object endpoint can not only offer vital 
information on intent, unveil potential conflict or threat and 
enable task facilitation strategies, but can also produce more 
accurate tracking routines Q-@- In this paper, we address 
the problem of predicting the intended destination of a tracked 
object from a finite set of possible endpoints and the future 
values of its hidden state (e.g. position, velocity, etc.), given 
the available noisy observations. This can be viewed as a 
means to assist or automate timely decision making, planning 
and resources allocation at a higher system level, compared 
with a conventional sensor-level tracker. The latter typically 
focuses on inferring the current value of the latent state Xt, 
with several well-established algorithms ng-iii- 
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To motivate the work presented here, consider the following 
two examples: 

1) Maritime Surveillance: analysing the route and determin¬ 
ing the destination of vessels in a given geographical area 
is necessary for maintaining maritime situational awareness 
(MSA), critical for maritime safety. This permits identification 
of potential threats, opportunities or malicious behaviour, 
allowing the protection of assets or other reactive actions |[T], 
0, 0, 0. Given the complex nature of typical maritime 
traffic as well as the vast amounts of available information 
on tracked targets, there is a growing interest in increasing 
the degree of automation in MSA systems by unveiling the 
intent of object(s) of interest from available low level tracking 
data. Thus, there is a notable demand for low-complexity 
and reliable destination and trajectory prediction techniques. 
Similar challenges can be found in aerospace surveillance, 
with aircraft in lieu of vessels. 

2) Interacting with touchscreen: touchscreens are becoming 
an integrated part of modern vehicles due to their ability to 
present large quantities of in-vehicle infotainment system data 
and offering additional design flexibility through a combined 
display-input-feedback module 1T3|,|T4). Using these displays 
entails undertaking a free hand pointing gesture and dedicating 
a considerable amount of attention that would otherwise be 
available for driving. Hence, such interactions can act as a 
distractor from the primary task of driving and have safety 
implications GD- The early inference of the intended on¬ 
screen item of the free hand pointing gesture can simplify and 
expedite the selection task, thus significantly improving the 
usability of in-vehicle touchscreens by reducing distractions 
(see HD for an overview of the intent-aware display concept). 

There is a wide range of other applications that can benefit 
from knowing the intent of a target of interest. For example, 
predicting the destination of an intruder in a perimeter, infer¬ 
ring the future position of a moving person in a given area 
to take appropriate robotic actions fTT) , and advanced driver 
assistance system in vehicles |[10, to name a few. 


A. Contributions 

The main contribution of this paper is the development 
of a simple and robust Bayesian intent inference approach 
that models the available partial trajectory of a tracked object 
as a random bridge terminating at a nominal destination. 
It can employ any linear Gaussian motion model, including 
Linear Destination Reverting (LDR) models, such as those 
detailed in Sections III-A and III-B which are intrinsically 
driven by the endpoint of the tracked object (they were used 
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in my The bridging framework introduced here capitalises 
on the premise that the path of the object, albeit random, 
must end at the intended destination. Since the endpoint is 
unknown a priori, a bridge for each possible destination is 
constructed. This encapsulates the long term dependencies in 
the object trajectory due to premeditated actions guided by 
intent. By determining the likelihood of the observed partial 
track being drawn from a particular bridge, the probability of 
each nominal endpoint, along with a refined estimate of the 
current object state Xt and its future values Xg (for s > t), 
can be evaluated. This is accomplished via Kalman-filter-based 
inference, amenable to parallelisation. 

Notably, the proposed approach in this paper does not 
impose prior knowledge of the arrival time T at the intended 
destination D. A conservatively chosen prior distribution of 
the possible times of arrival for each destination suffices. 
This allows the introduction of a technique to sequentially 
estimate the posterior distribution p{T\D) of the arrival time 
T from the available partial trajectory of the tracked object. 
Within the formulation adopted here, possible destinations 
may also be specified as (Gaussian) distributions, with a mean 
and covariance, in order to model endpoints with a non-zero 
spatial extent; point destinations can be modelled by setting 
the covariance to zero. Finally, we conduct simulations to 
illustrate the inference capability of the bridging-distribution- 
based predictors using real pointing data (HCI) and synthetic 
vessel tracks (MSA). 

B. Related Work 

One of the first techniques to incorporate predictive inform¬ 
ation on the target endpoint to improve the accuracy of the 
tracking results was proposed in Q and motivated various 
subsequent studies such as Q. It assumes prior knowledge of 
the time of arrival at destination to devise a destination-aware 
tracker. In this paper, the objective is to predict the intended 
destination of the tracked object, using motion models that 
are also inherently dependent on the endpoint. This can be 
inferred using a multiple Kalman-filter-based solution, even 
without imposing knowledge of T. It is noted that unlike 
the widely used interacting multiple models (IMM) 
m and generalised pseudo-Bayesian | |20) approaches for 
manoeuvering targets, here we construct a bridge model per 
nominal destination and no interaction or switching among 
models is applied. This is based on the premise that the tracked 
object intent is set well in advance of reaching its endpoint, 
leading to simple and low complexity algorithms. 

More recently, destination-aware trackers that facilitate in¬ 
ference of the object state Xt followed by an additional 
mechanism to determine its endpoint are presented in 0-0, 
succeeding the work in pT[ . The object trajectory is modelled 
as a discrete stochastic reciprocal or context-free grammar 
process, which can be regarded as non-causal generalisations 
of Markov processes. The state space is discretised within 
predefined regions, for example, the spatial dimensions are 
divided into finite grids. The target can accordingly pass 
through a finite number of zones. Here, in contrast, we adopt 
continuous state space models with bridging distributions 


that do not impose any discretisation/restrictions on the path 
the tracked object has to follow to reach its endpoint. This 
formulation is particularly important in applications where 
discretisation of the spatial area is burdensome, e.g. tracking 
objects in 3D as with free hand pointing gestures or surveying 
a large geographical area for MSA. The method proposed here 
provides a simple, effective solution to the intent prediction 
problem compared with those in Q, 0; it also combines the 
destination prediction and tracking operations. 

Intent inference is often treated within the context of anom¬ 
aly detection, e.g. 0,0,0-0, m- A common approach 
is to categorise each trajectory of the tracked object(s) as 
either normal or anomalous, i.e. classification techniques. This 
follows defining (or learning from recorded data) a pattern of 
life that constitutes ordinary behaviour. Deviations from this 
are considered to indicate an anomaly, assuming adequate data 
association algorithms 0, ©■ For example, a support vector 
machine (SVM) based method is introduced in [5] to classify 
deviant trajectories. A model-based technique, based on a 
tuned hidden Markov model, is utilised in 0 to characterise a 
pattern of life and predict the future position of a conforming 
target. Similarly, in 0 and related work, ‘tracklets’, which are 
sub-patterns comprising a trajectory, are defined as a means 
to capture a semantic interpretation of complex patterns such 
as anomaly or intent. In this paper, a probabilistic model- 
based formulation is proposed to tackle the intent inference 
problem rather than anomaly detection, with the posterior 
probabilities of various intents sequentially calculated using 
bridging distributions. 

The benefits of predicting the intended item on a Graphical 
User Interface (GUI) early in a pointing task are widely recog¬ 
nised in HCI, e.g. |[2^-|[26). Most existing algorithms focus on 
pointing via a mechanical device, such as a mouse, in a 2D set¬ 
up. In | [T6) , 2D-based predictors are shown to be unsuitable 
for pointing tasks in 3D. For instance, the linear-regression 
methods in | |24l and | [25l assume that the the destination is 
always located along the path followed by the pointing object, 
which is rarely true in free hand pointing gestures. Endpoint 
inference based on modelling the pointing movements as 
a linear destination reverting process is considered in m- 
Compared to fTht , the bridging-based-solution developed here 
is more general and more robust to variability in the target 
behaviour, leading to superior prediction results. 

Finally, data driven prediction/classification techniques, 
such as in (|^, (ig, j^, rely on a dynamical model 
learnt from recorded tracks/behaviour. This often entails a high 
computational cost and necessitates substantial parameters 
training from complete data sets (not always available). In this 
paper, as is common in tracking applications |[T0)-|[Tg, known 
dynamical and sensor models (e.g. due to practical physical 
constraints), albeit with unknown parameters, are presumed, 
i.e. a state-space-modelling approach. It requires minimal 
training, is amenable to parallelisation and is computationally 
efficient, yet delivers a competitive performance. 

C. Paper Outline 

The remainder of this paper is organised as follows. In 
Sectionjl^ we formulate the tackled problem and objectives. A 
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range of possible motion models are outlined in Section[nI|and 
the prior of the state value at destination, i.e. Xt, is addressed. 
In Section [TVl the bridging-based-prediction is introduced and 
pseudo-code for the proposed algorithms is provided. The 
performance of the proposed techniques is evaluated in Section 
[V| using real free hand pointing gesture data and synthetic 
vessel tracks. Finally, conclusions are drawn in Section [Vl] 

II. Problem Statement 

Let D = {d : d = 1, be the set of N nominal des¬ 

tinations, e.g. harbours where a vessel can dock or selectable 
icons displayed on a touchscreen. The time instant the tracked 
object reaches the a priori unknown intended destination D 
is denoted by T. Whilst no assumptions are made about the 
layout of D, each endpoint is modelled as an extended region, 
e.g. GUI icons or harbour, rather than a single point as in 
fl^ . Hence, the d*^ destination is defined by the (Gaussian) 
distribution d Af {ad, S^); see Section 

The objective here is to dynamically determine the probab¬ 
ility of each possible endpoint being the intended destination. 


III-C 


r{t„) = {p{D = d I : d = 1,2,..., N} 


( 1 ) 


where yi,n — {yi,y 2 , ■■■,yn} is the available partial trajectory 
or observations of the tracked object at the time instant ^ T. 
For example, as in 1161, we have y^ = [xtk Vtk i® th® 

Cartesian coordinates of the pointing finger at as captured 
by the pointing-gesture tracking device. Each observation is 
assumed to be derived from a true, but unknown, underlying 
target state Xt^ at time f„, which can include position, velocity 
and higher order kinematics. Given 'P{tn), the Maximum a 
Posteriori (MAP) estimate 

= argmax p{D = d\yi,n), (2) 

d=l,2,...,iV 


is an intuitive approach to determine the intended endpoint 
D following the arrival of the new observation ?/„. It should 
be noted that other decision criteria can be adopted, although 
this is not treated here. Therefore, the proposed probabilistic 
endpoint prediction relies on a belief-based inference, i.e. 
calculating V{tn) in Q, followed by a classifier, e.g. (|^. 

As well as establishing D at time tn, a refined estimate of 
the system latent state Xt^ and its predicted future values, 
i.e. Xf for t* > tn, and time of arrival at D are sought. 
A successful prediction at tn of the tracked object’s final 
destination and/or its future trajectory can reveal its intention, 
alerting an operator T — tn in advance of any potential conflict 
or reducing the pointing time by T — in an HCI context. 


III. System Models 

A linear and Gaussian motion model for the evolution of the 
physical state Xt of the tracked object is assumed throughout 
this paper. Whilst the system governing the target dynamics 
does not change over time, it does depend on the object 
eventually reaching its destination Z? G D. Conditioned on 
knowing the endpoint D = d, this leads to a linear time- 
invariant Gaussian system such that the relationship between 
the system state at times t and t + h can be written as 

Xt+h=F{h,d)Xt + M{h,d)+et (3) 


with Et ~ JV {0,Q{h,d)). The matrices F and Q as well as 
the vector M, which together define the state transition from 
one time to another, are functions of the time step h and the 
destination d. 

The observation is modelled as a linear function of 
the time state perturbed by additive Gaussian noise, 

Un = FiXt^ + Vn (4) 

where Vn ~ N {0,Vn)- No assumption is made about the 
observation arrival times tn and irregularly spaces, asynchron¬ 
ous observations can naturally be incorporated within the 
framework presented here. Based on Q and Q, a Kalman 
filter can be utilised to calculate both the posterior distribution 
of the latent state and the observation likelihood for the 
current set of measurements yi,n | |28) , conditioned on knowing 
d. The computationally efficient Kalman filter is particularly 
desirable since running, concurrently, multiple Kalman filters 
for c? G D, in real-time, is plausible, even in settings where 
limited computing power is available, such as on a vehicle 
touchscreen or a portable battery-powered system. 

To condition on the destination, the inference framework 
requires that the transition density of such models can be 
calculated both from one observation to the next (i.e. from 
time instant /„_i to /„) and from the current observation time 
to the arrival time at the intended endpoint (i.e. from to 
T). Continuous-time motion models are therefore a natural 
choice, where the tracked object dynamics is represented 
by a continuous-time stochastic differential equation (SDE). 
This SDE can be integrated to obtain a transition density 
over any time interval. Eor Gaussian linear time invariant 
(LTI) models, this integration is analytically tractable, giving 
transition functions of the form in This class of models 
includes many widely used ones, e.g. (near) constant velocity 
(CV) and (near) constant acceleration (CA) models, as well 
as the linear destination reverting models given below. 

The early work in p6| employs LDR models in a forward 
sense, without using the destination state as conditioning 
information. In this paper, by contrast, a prior probability 
distribution on the object state at its destination X-r is used dir¬ 
ectly as conditioning information in the inference procedure. 
This is achieved by using bridging distributions to introduce 
the longer term dependencies in the target trajectory (see 
Section |III-C[ ). A similar idea is explored in the preliminary 
study in using the CV model, in which the endpoint 
information does not feature in the motion model. That work 
also specifies each destination as a single point (rather than 
a prior distribution) and applies a two step modified Kalman 
filter to infer D, which is more computationally demanding 
and obscure compared with the solution presented here. 

In the remainder of this section, we describe the LDR 
models of for completion, other related motion processes 
and the prior of Xt- The predictive position and velocity 
distributions of selected motion models, with and without the 
use of bridging (i.e. conditioning on Xt) are depicted in 
Eigure[T]in the one dimensional case. The figure demonstrates 
the substantial effect of introducing the bridging assumptions 
on the prediction results of various motion models. It clearly 
shows how these distributions more accurately model the 
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Position, no bridging 


Position, w/ bridging Velocity, w/ bridging 



Figure 1; Predictive distribution at t = 0 of position without bridging 
(first column), predictive position with bridging (second column), 
and predictive velocity with bridging (third column) for the BM, 
MRD, CV and ERV models in ID. The at-axis shows time t and y- 
axis depicts position/velocity. The destination (black star), is located 
at position 10 at time T = 10. Grey shading shows one standard 
deviation, white line depicts the mean (except for BM and MRD 
velocity, for which the black line shows the implied mean of velocity 
only). Parameters for the models are a = 1 (all models), A = 0.3 
(MRD), 77 = 0.1, p = 0.5 (ERV). 


predicted state at the endpoint. It is worth noting that any 
continuous-time LTI model could be used directly within the 
proposed inference framework, as long as the system can be 
expressed by equations ([^ and Q. 

A. Mean Reverting Diffusion (MRD) 

The position of the tracked object for the Mean Reverting 
Diffusion (MRD) model follows an Ornstein-Uhlenbeck pro¬ 
cess with its mean being the destination. The intuition 
behind MRD is that a target heading towards a particular 
endpoint will revert towards it. The reversion strength is 
stronger the further the target is from its destination. This 
produces the predictive position and velocity dynamics of the 
form displayed in the second row of Figure 

The state Xt of the mean reverting diffusion model consists 
of only the target position in each of the s spatial dimensions. 
For endpoint d (located at position pd), the SDE is given by 

dXt = A{pd-Xt)dt + adwt, (5) 

where A is a diagonal matrix with diagonal elements 
that set the reversion coefficient in each dimension. The diag¬ 
onal matrix a specifies the standard deviation of the dynamic 
noise and Wt is a standard (unit variance) s-dimensional 


Weiner process. The diagonal structure of a implies that the 
noise is independent in each spatial dimension, which is a 
common assumption in tracking m- However, this can be 
relaxed as shown in equation Q below. 

As in by integrating Q from t to t + h, we obtain a 
state transition function in the form of equation ([^ with 

FMRD{h,d) = e~^^, 

A/mrd(/i, rf) = {Is — e~^^)Pd, (6) 

QuRoiffd) = ^[ls- 

where Ig is the s x s identity matrix. For non-diagonal cr, the 
element of Qmrd is given by 


QMRD.ij 


-^33 


2^ _ g—(A-ii-l-Ajj)?i 


(7) 


where z' is the transpose of the vector/matrix z. A special 
case of the MRD model occurs when A = Og (with Og being 
the s X s matrix of zeros), in which case the dynamics of the 
target position follow a Brownian motion (BM). In this case, 
the F, M and Q matrices in ([^ become 


Arm (^5 dl) — hlgy 

MBM{h,d) = Osxi, ( 8 ) 

< 3 bm(/i, d) = ha"^, 


where Ogxi is an s x 1 column vector of zeros. 

If measurements are direct noisy observations of the 
tracked object position, the observation matrix G in equation 
0 is simply the identity matrix, G = ff- 


B. Equilibrium Reverting Velocity (ERV) 

In the ERV model, introduced in for tracking and 
destination inference for pointing tasks on in-car displays, 
the system state Xt at time f is a 2 s x 1 vector, arranged 
as [xi,..., Xg, ± 1 ,..., Xg]', where Xi is the position in spatial 
dimension i, and it is the velocity in that dimension. Under 
this model, the SDE governing the evolution of the tracked 
object state is 

dXt = A{pd-Xt)dt + adwt, (9) 

where the mean pd = [Pd’^sxiY contains the position pd of 
destination d; Wt is a standard s dimensional Wiener process 
and i? = [Og, cr] is a 2s X s matrix that controls the noise which 
affects the velocity components of the process (modelling 
random forces acting on the target). In this case, a can be 
given by the s x s Cholesky decomposition of the velocity 
noise covariance E, such that E = aa'. As with the MRD 
model, a common choice is a diagonal cr, i.e. independent 
noise in each spatial dimension. The matrix A is given by 


where p is an s x s diagonal matrix of the drag coefficients 
(can be assumed to be the same across all dimensions) and p 
is a s X s diagonal matrix of the mean reversion strengths in 
each spatial dimension. 
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A physical interpretation of the ERV model is that the 
destination d exerts an attractive force on the tracked object, of 
strength proportional to the distance separating them. This can 
be viewed as a linear spring of zero natural length connecting 
the target to its endpoint. A drag term proportional to the 
velocity of the target is also included, allowing the velocity 
profile of the tracked object (e.g. pointing-finger-tip in a free 
hand pointing gesture) to be correctly modelled; see Figure [T] 
(fourth row) and fH- 

Integrating the SDE in (j^ from t to t + h allows the system 
evolution to be expressed in the form in equation ([^ with 

FERv{h, d) = e~^^, 

MERy{h,d) = {l2s — ( 10 ) 

pt-\-h 

QERy{h,d)= 

The covariance matrix Q^Rsih, d) can be calculated by Matrix 
Fraction Decomposition pT) where 

QERyih.d) = JK~^, 


J 

K 


= exp 


'-A 

aa' 

h) 

02 S 

_02s 

A' 

h) 



A special case of the ERV model occurs when 77 and p are 
both zero. In this scenario, the model reduces to the widely 
used (near) constant velocity model with 


Fcy{h, d) 



Mcy{h, d) = O2SXI, 
Qcy{h,d)= 


aa 




2 

aa'h 


( 11 ) 


If observations are direct noisy measurements of the target 
position, the observation matrix G in equation 0 is given by 
G=[ls 0«]. 


object velocity at the destination. If this is unknown, a large 
prior variance can be used to model this uncertainty. 


IV. Intent Inference Using Bridging Distributions 

For motion models of the form in equation ([^, and con¬ 
ditioning on a given destination d S D as well as arrival 
time T, the posterior of the system state can be expressed 
by p{Xt^ I yi-,n^T,D = d) and the observation likelihood 
is p{yi-.n \ T, D = d) after n measurements. The graphical 
structure of this system is depicted in Figure The intended 
destination D influences the state at all times via the reversion 
built into the LDR motion models (for non-destination revert¬ 
ing models, e.g. CV and BM, the endpoint only affects the 
final state). Nonetheless, the inclusion of the prior on in 
equation ( [T 2 I 1 changes the system dynamics compared to those 
of the MRD and ERV models in m- Under the assumption 
that the destination is known, the posterior distribution of the 
state changes from a random walk to a bridging distribution. 
This is clearly visible in Figure 2, especially in terms of 
producing consistent and meaningful predictions. 


A. Filtering and Likelihood Calculation 

An elegant way to filter for Xt with a destination prior 
is to extend the latent system state to incorporate Xt, thus, 
it becomes Zt — [X'f. X'j,]'. Filtering is then carried out 
for p{Zt^ I yi-n,F) = d,T), permitting the calculation of 
the observation likelihood pijji-.n \ D = d,T). The key 
to developing this filter is to calculate the transition density 
p{Zt+h \ Zt,D = d, T), given by 

p{Zt+h \Zt,D = d, T) = p{Xt I Xt+h, Xt, D = d,T) 

xp{Xt+n\Xt,XT,D = d,T) 
= p{Xt+h \Xt,XT,D = d,T). 

(13) 


C. State Value Prior at Destination 

Knowing the intended destination D of a tracked object 
gives information about the system state at some future time 
T. This can be modelled by a prior probability distribution for 
Xt corresponding to the geometry of the destination, since 
most endpoints are extended regions (e.g. GUI buttons or 
harbours), rather than single points. In order to maintain the 
linear Gaussian structure of the system, the following Gaussian 
prior on the object state upon arrival at destination at time T: 

p{XT\D = d)=N{XT-,ad,S:d), ( 12 ) 

is assumed. This effectively models the destination as el¬ 
lipsoidal, since the iso-probability surfaces of the Gaussian 
distribution are elliptical. The mean vector Od specifies the 
location/centre of the destination, for example, ad = Pd 
for the MRD model and ad = Pd from equation (|^ for 
the ERV model. Whereas, Ed is a covariance matrix of the 
appropriate dimension, which sets the extent and orientation 
of the endpoint. In the case of the ERV model, defining the 
destination also entails specifying a distribution of the tracked 


This follows because p{Xt \ Xt+n, Xt, D = d,T) — 1 for 
any value of Xt, due to the fact that the state Xt is included in 



Figure 2; The graphical structure of the system after n observations. 
The destination D plays a similar role to the prior distribution of 
Xtj in addition to affecting the state transition function. Heavy lines 
indicate a deterministic relationship (early state transition matrices 
are not shown in the figure). 
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the conditioning information via Zt. The distribution p{Xt+h \ 
Xt,XT,D = d,T) is given by 

p{Xt+h I Xt, Xt, D = d,T)<x p{Xt I Xt+h, D = d,T) 

xp{Xt+h \XuD = d,T) 
= M {Xt; F,Xt+h + Q,) M {Xt+h] F^Xt + Mh, Qh) 

(14) 


where the F, M and Q matrices are taken from the motion 
models in Section ||n] with 

F^ = F{T-t-h,D) Fh = F{h,D), 

M^ = M{T -t-h,D) Mh = M{h,D), (15) 

Q^ = Q(T-t-h,D) Qh=QiKD). 

This comes from the system structure in Figure and the fact 
that for the continuous-time integrable models used here, the 
state transition density can be calculated over any time period. 

The following Gaussian identity simplifies the state trans¬ 
ition density in equation ( [T4l i: 

JV {x; pi,'Zi)JV {p 2 ', Lx,^ 2 ) = zJV {x; (16) 


where L is a matrix of appropriate size, x is a normalizing 
constant that does not depend on x, and 

/i* = S -f L'Fi2 ^ 1 ^ 2 ) • 

This leads to 


p{Xt+h I Xt, Xr, T,D = d)=N [Xt+h] ct,Ct), 

where 

ct = Ct [Q^\FhXt + Mh) + F'^Q-\Xt - M,)] , (17) 
= FlfZi -f rrit, 

with, for an r-dimensional state vector Xt, Fit a r x2r matrix 
and m* a r X 1 vector of the form 

Ht = [CtQl^Fn, CtF'Q-i], 
mt = Ct{Q^^Mh-F'^Q-^M,). 


This allows the state transition with respect to Zt to be written 
as a linear Gaussian transition of the form 


Zt+h — RtZt + rht + 7t 

7 t-AA( 0 ,C/t). 


where 

Rt 


Ht 

Ft ’ 


mt 



Ct 

0 ^ 



(18) 


(19) 


and Ft = [O,. C] ■ With respect to this extended system, the 
fc-dimensional observation vector at each observation time is 
given by 


Vn — CZt^ 


+ 


( 20 ) 


with G = [G, Ofcxr] and where G and Vn are as those in 
equation Q. 


Algorithm 1 Conditioned Prediction Error Decomposition and 
State Inference - Kalman Filter (single iteration) 

Notation: Z„, E„} = KF(i/„, E„_i, i?*, Ut, G) 

Input: Observation y„; previous posterior state mean 
estimatel Z^-i', previous posterior state covariancel E„_i; 
state transition matrix Rt and covariance Ut from equation 
observation matrix G from equation ( [20l i 
Predicfl: 

Zn\n—1 — RtZn—1 F Fit 

'Z‘n\n-1 = RtZin-lR't + Ut 

PED Calculation: 

ln= ■N' (pn] 

Correct: 

K = F'nln-lG' {GYin\n-lG' + Vn) 

Zn — Zn\n—1 “f HiVn GZn\n—l 

Tin = {Ir — KG)Tn\n-l 

Output: PED in = PiUn I Vi-.n-iiF) = d,T)-, posterior 
mean of state at time f„, Z^, posterior state covariance 
Tn-l- 

iFor first observation yi, skip predict step and use prior 
mean and covariance for from equation (|2T| in place of 
Zn\n-i Sn|n-i In Hkelihood calculation and correction 
steps. 


Equations ( flSj ) and ( [20l i form a standard linear Gaussian 
system, albeit with a degenerate state transition covariance 
matrix. Thus, a standard Kalman filter can be applied to 
calculate the conditioned posterior filtering distribution of 
the state and the conditioned prediction error decomposition 
(PED), p{yn I yi-.n-i,D = d,T). This latter is shown in 
Sections |IV-C| to |IV-E| to be key to destination inference, 
because it allows the destination and end-time conditioned 
observation likelihood of the partially observed track to be 
calculated recursively as p{yi:n \ D = d,T) = p(jjn \ 
yi,n-i,D = d,T)p{yi,n-i \ D = d,T). The algorithm for 
a single iteration of the Kalman filter is given in Algorithm 
It requires the prior Zt^ = [X't^ , X'j,]' at the first observation 
time. The prior on Xt^ is the standard prior on the initial 
state. The prior on Xt is derived from the destination as 


described in Section III-C and can be neatly incorporated into 


the Kalman filter. Both these priors must be Gaussian, and in 
most modelling scenarios they will be independent of each 
other, giving a prior for Zt^ of the form 


p{Zt, \T,D = d)=M 


Xt, 

Xj' 


Si Ot- 

Or Td 


( 21 ) 


where pi and Si specify the initial prior on Xt,, i.e. p(Xt, | 
D — d) = Af {Xt^; pi,Ti)-, and S^ come from the 
destination prior in equation ([T^. 


B. Unknown Arrival Times 

So far, it has been assumed that the arrival time at the 
destination T is known a priori. Often, this is not a realistic 
assumption and we are interested in the probability of the 
tracked object arriving at a destination at any time within 
some time interval. In this case, the unknown arrival time T 






















SUBMITTED TO IEEE TRANSACTIONS ON CYBERNETICS, VOL. XX, NO. XX, XXXX XXXX 


7 


is treated as a random variable, which must be integrated over 
in order to infer the destination of the tracked object. 

The observation likelihood with unknown arrival time is 
given by p{yi:n \ D). This can be calculated by integrating the 
arrival-time-conditioned likelihood calculated by the Kalman 
filter in the preceding section (see Algorithm 0 over all 
possible arrival times according to 


\D)= [ I T, D)p[T I D)dT, (22) 

JTeT 


where p{T \ D) is the a prior distribution of arrival times for 
destination D and T is the time interval of possible arrival 
times T (D = dis here replaced by D for notational brevity). 
For example, arrivals might be expected uniformly within 
some time period [ta.h], giving p{T \ D) =U{ta,tb). 

In most cases, p{yi-n \ T, D) after n observations is a 
nontrivial function of T resulting in intractable integrals. A 
numerical approximation to the integral in equation ( |22l l can 
be obtained via numerical quadramre, which is viable since the 
arrival time is a one-dimensional quantity. The approximation 
requires multiple evaluations of the arrival-time-conditioned- 
observation-likelihood (for various arrival times T) for each 
of the N nominal endpoints. Therefore, in time-sensitive 
applications, it is likely that only a few quadrature points q 
can be used. 

Whilst adaptive quadrature schemes might seem appealing 
for their efficiency and accuracy, they are not easily applied 
to this problem. For a given arrival time, only a single 
iteration of the Kalman filter as in Algorithm need be 
run (per destination d G D), following the arrival of a new 
observation. Thus, if a fixed quadrature is utilised with q 
quadrature points, Nq Kalman filter iterations of the form in 
Algorithm [T] are required per observation arrival. For adaptive 
quadrature, it will, in general, be necessary to calculate the 
observation likelihood for a different set of arrival times at 
each step. This, however, requires the Kalman filter be re-run 
from scratch for all n available observations, imposing nNq 
iterations. Hence, simple fixed grid quadrature schemes are 
employed here. A Kalman filter iteration as in Algorithm [T] is 
not computationally intensive and each of the Nq iterations, 
required per observation, can be performed in parallel. For 
arrival times before the current observation time Ti < tn, the 
arrival-time-conditioned observation likelihood is taken to be 
zero, such that p{yi-n \T = Ti < Z3) = 0. 

Here, we apply a Simpson’s rule quadramre scheme, with an 
odd number q of evenly spaced quadramre points, Ti,...,Tq. 
This approximates the integral in equation (|22|) by 


piyi-.n I D) 


Tq-Tl 

3(g-l) 


P{yi:n I T = Ti,D) 
(9-1)/2 


+ P{yi:n\T = Tq,D) T ^ ^ p{yi,n \ T = T2i, D) 

■ (23) 


(g-l)/2-l 

+ 2 ^ p{yi,n\T = T 2 t+i,D) 


i=l 


Other numerical integration schemes such as Gaussian quad¬ 
rature can be employed. It may also be desirable to use unequal 


intervals between quadrature points, e.g. if the distribution of 
arrival times is heavily peaked in particular regions. 


C. Destination Inference 


The posterior distribution of the nominal destinations, i.e. 
p{D = d I yi-.n), d S D in equation Q, at the time instant 
can be expressed via Bayes’ theorem by 

p{D = d I yi,n) cx p{yi:n I D = d)p{D = d). (24) 

The discrete probability distribution p{D = d) defines a 
prior over all possible destinations; it is independent of the 
current track and can be obtained from contextual information, 
historical data, etc. Alternatively, a non-informative prior can 
be used where p{D = d) = 1/N for all d € D. 

Algorithm shows how the posterior distribution over 
destinations p{D = d \ yi-n) in Q can be inferred sequentially 
given a series of observations yi.„. It begins by initializing 
the running likelihood estimate for each d G D and the 
current posterior state mean as well as covariance 

for each destination d and quadrature point i (corresponding 
to arrival time Ti) to their priors. After each observation, 
the Kalman filter iteration in Algorithm [T] is utilised to cal¬ 
culate the one step arrival-time-conditioned-observation FED 
= pijjn I yi-.n-i,T = Ti,D = d) for each destination 
d G D and quadrature point i. This is then used to calculate 
the overall arrival-time-conditioned observation likelihood 

=piyi:u\T = T,,D = d), 

= PiVn I yi-.n-l,T = Ti,D = d)p{yi:n-l \ T = T^, D = d), 
= X (25) 

The Kalman iteration also determines the corresponding up¬ 
dated posterior state mean and covariance which 

are necessary for the next steps. 

After mnning a Kalman iteration for all quadrature points, 
the quadrature function (and arrival time prior) is used on 
the likelihood calculated at each of these points for a 

destination d in order to approximate the integral in ( p2| . This 
gives the observation likelihood p{yi:n \ D = d) for each 
destination d G D, 

piyi..„ \D = d)^ quad(4‘'-i),4"-2)^ (26) 


For a finite set D, the probability of any given destination as in 
0 and plcmn be determined by evaluating the expression in 
equation^24ll for each d G D, followed by the normalisation 


p{D = d I yi:n) 


p{yi:n I D = d)p{D = d) 

I D = j)p{D = j) ’ 


which ensures that the total probability over all possible 
destinations sums to 1. In Algorithm this is approximated 
(due to numerical quadrature) by Ud ~ p{D = d \ yi-.n)- 
For a hxed set of quadrature points, this estimated posterior 
can be updated sequentially after each observation, making it 
tractable for a large numbers of possible destinations {N) and 
quadrature points {q). 
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Algorithm 2 Destination Inference 


Set Lq = 1 and set Zq 


{d,i) 
0 

, i = 


to priors 


Input: Observations 

Initialize: Set 

from equation ( | 2 T] l for all d G 
for observations n = 1,iV do 
for destination d G D do 

for quadrature point i G 1 , • ■ ■, <7 do 

Calculate in equation ( [T 8 | ) for obser¬ 

vation time tn, destination d and arrival time Ti 
Run Kalman filter iteration; 


{e 




}=, 


is the FED for a known arrival time, i.e. 
= piVn \ yi:n-l, D = d,T = T,)} 

Update likelihood; = L^^-l ^ 

end for 

Calculate likelihood approximation; 


P 

r 


(d) 


= quad(Lr^\Li‘'’"\...,LL''’«)) 


' Pivi-.n I D = d) \ quad is quadrature function} 


end for 

for destination d G D do 

_ p{D=d)P<^) 

T.a^nPiD=d)Llf> 

end for 

Destination posterior after n‘'’ observation: 

p{D = d I yi-.n) « Ud 

end for 


D. Arrival Time Inference 


In addition to inferring the intended destination D of the 
tracked object, it is also possible to infer a posterior distri¬ 
bution of the time at which the target is expected to reach 
its intended (unknown) endpoint. For a specihc destination 
£) = d, this is given by 


p{T\D = d, cx p{y^..t \T,D = d)p{T \D = d). (28) 


Since the quadrature procedure used in Section [TV-A| requires 
calculation of the arrival-time-conditioned-likelihood p{yi-t \ 
T = Ti,D = d) for a number of quadrature points Ti, a 
discrete approximation of the overall posterior can be obtained 
almost without additional calculations via 

p{T \ D = d,yi,t) (29) 

i=l 


where d^Ti} is a Dirac delta located at Ti. This is a weighted 
distribution over the quadrature points with 


Wi = 


p{yi-.t \ T = T,,D = d)p{T, \D = d) 


ELi PiVi-t I T = Ti, D = d)p{Ti \ D = d)' 

These weights are normalized evaluations of the expression 
in equation ( |28] l, calculated at each Ti. Normalization ensures 
that the approximate posterior distribution in (29 1 is a valid 
probability distribution that integrates to 1 . 


The posterior distribution of the arrival time at any destina¬ 
tion can be calculated without signihcant further calculations 


by integrating over all endpoints (since D is a discrete set) as 

p{T I yi-.t) = p{T, D = d\ yi:t) 

deD 

cx J2piyi:t \T,D = d)p{T I D = d)p{D = d). (30) 

deD 

Since the conditioned likelihood p{yi:t \ T,D = d) is 
determined for each d G D and quadrature point T = Ti, 
i = 1,2, ...,q, the distribution in ( [30| can be approximated by 

9 

P{T I yi.t) « 

with the weights defined by 

_ EdgDF(di:t \Ti,D = d)p(T, I D = d)p{D = d) 
ELi EdeDP(di:t \T^,P = d)p{T, I D = d)p{D = d)' 

This relies on the assumption that the set of arrival time 
quadrature points are the same for each destination d G D. 


E. State Inference and Trajectory Prediction 


Estimating the current state of the tracked object (e.g. 
position) and predicting its future state can also be readily 
preformed within the framework presented. The posterior 
estimate of the target state p{Xt^ \ yi-.t) is given by integrating 
over all possible destinations and arrival times, i.e. 


p{Xt„ I yi:t) 


\yi..t,T,D = d) 

- dGD 


X p(T I D = d)p[D = d) 


dT. 


(31) 


The estimated state at the time instant conditioned on the 
arrival time Ti and endpoint d is p{Xt^ \ yi,t, T = Ti, D = d). 

It has a Gaussian distribution with mean and variance given by 
and PtT,^’^^P[, respectively, where, Pt = [ir 0^.1, 
i.e. the components of zif'^ and in Algorithm 

corresponding to Xt„ rather than Xxi . 

The distribution in ( |3T] l can be approximated from the cal¬ 
culated arrival time and destination conditioned state estimates 
by the Gaussian mixture 

I yi,t) « E E 

2—1 dG'B 

where weights in the above summation are given by 

- pivif I PjyP = d)p{Ti \ D = d)p{D = d) 

ELi £dGDF(di:t \ T^l D = d)p{T, I D = d)p{D = d)' 

(32) 

The future state of the tracked object can be estimated 
using its dynamics model, applied to each arrival time and 
destination. This is identical to the ‘predict’ step of the Kalman 
filter in Algorithm [T] for the future time instant of interest 
t ^ tji. For a given arrival time T = Ti and destination 
D = d, the prediction of the object future state is Gaussian 
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with mean and covariance specified by, respectively, 




+ m 


(d,i) 

f 


\n 




)' + 


Each of and are calculated in the same 

way as Rt, mt and Ut in equation (|T^ for a time step h, 
corresponding to the required prediction time, i.e. h = t* — tn- 
The predicted distribution of the target state is a Gaussian 
mixture with the same component weights Ui^d as for state 
estimate at the current time instant in ([3^ such that 


/ 

I yi:t) ~ X! X! i 


{d,i) 

t* \n 


^ t*\n t 


i—1 


(33) 


The inference algorithms for the posteriors of the destination, 
arrival time, and current and future state of the tracked 
object can be readily parallelised, with calculations for each 
quadrature point and/or nominal destination able to be run 
largely independently on an independent processor. Only the 
weight normalization step as in e.g. ( |3^ requires results from 
all calculations to be available, fitting naturally into e.g. a map- 
reduce programming paradigm for parallel implementation. 


V. Results 

In this section, we evaluate the performance of the proposed 
bridging-distributions-based intent inference approach in two 
application areas, namely and maritime surveillanc^ 

Maximising the likelihood function n/=i Pivi-r \ D = d,V,) 
for a sample of J typical trajectories (constituting the training 
set) is the criterion adopted below to set the motion model 
parameters fl . For example for the MRD and ERV models, 
we have = {A, A, a}. The learnt values are then applied to 
out-of-sample tracks. This parameter estimation procedure is 
suitable for an operational real-time system where parameter 
training is an off-line process based on historical data. 


A. Intent-aware Interactive Displays 

In the results presented here, 50 free hand pointing gestures 
pertaining to four participants interacting with an in-vehicle 
touchscreen are used (5 tracks are utilised for training). This 
data is collected in a system mounted to the dashboard of 
an instrumented car (identical to the prototype used in 1161). 
It consists of an ll” Windows tablet and a gesture-tracker, 
namely the Leap Motion (EM) Controller p^ . The EM 
produces, in real-time, the 3D cartesian coordinates of the 
pointing finger/hand, i.e. = [xt^ yt„ at an average 

frame rate of « 30Hz. An experimental GUI of a circular 
layout is displayed on the tablet screen; it has 21 selectable 
circular icons that are « 2 cm apart. Participants are asked 
to select a highlighted on-screen GUI item, giving a known 
ground truth intention £)+. Nevertheless, in all experiments. 


'Please refer to the attached video demonstrating an intent-aware display 
operating in real-time on a sample of typical in-car free hand pointing gestures. 

^Please refer to the attached videos demonstrating the destination and track 
prediction for a vessel approaching a coast with multiple possible ports. 



Pointing Task Durations (s) 


Figure 3: Distribution of the pointing times, p{T\D), from over 4000 
in-vehicle free hand pointing gestures to select on-screen icons. 


the predictor is unaware of the track end time T and the 
intended destination, when making decisions. To demonstrate 
the possible range of times of arrival at destination. Figure 
1^ depicts the distribution of T for 4,000 recorded in-vehicle 
free hand pointing gestures aimed at selecting on-screen icons. 
This empirical p{T \ D) is used as the arrival time prior for all 
destinations within the bridging-distribution-based predictors. 

The inference performance is evaluated in terms of the 
ability of the predictor to successfully establish the intended 
icon D via the MAP estimator in 0, i.e. how early in the 
pointing gesture the predictor assigns the highest probability 
to D. The prediction success is defined by S'(/„) = 1 if 
D{tn) = and 5(/„) = 0 otherwise, for observations at 
times tn G {ti,t 2 , This is depicted in Figure versus 

the percentage of completed pointing gesture (in time), i.e. 
tp = 100 X tn/T, and averaged over all considered pointing 
tasks. Figure shows the proportion of the total pointing 
gesture (in time) for which the predictor correctly establishes 
the intended destination. 

In Figures[^and[^ we assess the linear destination reverting, 
BM and CV models with the bridging prior notated here as 
MRD-BD, ERV-BD, BM-BD and CV-BD. A mean reverting 
diffusion model without bridging (MRD) is also shown to 
illustrate the gain attained by incorporating the prior on Xp- 
Additionally, the benchmark Nearest Neighbour (NN) and 
Bearing Angle (BA) methods are examined (see p6) for 
more details). In the former, the destination closest to the 
pointing finger-tip position is assigned the highest probability 
and vice versa; i.e. p{yn\D = d) = M {yn;Pd,cr%]^) where 
is covariance of the multivariate normal distribution. 
In BA, p{yn\yn-i,D = d) = A/" (6I„ |0, cr|^) where 6i„ = 
Z (?/„ — yn-i,d) is the angle to destination D and is a 
design parameter. It assumes that the cumulative angle to the 
intended destination is minimal. 

Figure shows that the introduced bridging-distributions- 
based inference schemes CV-BD and ERV-BD, achieve the 
earliest successful intent predictions. This is particularly vis¬ 
ible in the first 70% of the pointing gesture where notable 
reductions in the pointing time can be obtained and pointing 
facilitation regimes can be most effective (e.g. expanding 
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Figure 4; Mean percentage of successful destination inference as a 
function of pointing time. 
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Figure 5: Gesture portion (in time) with successful prediction (error 
bars are one standard deviation). 


icon(s) size, colour, etc.). Destination prediction towards the 
end of the pointing gesture, e.g. in the last third of the pointing 
time, has limited benefit, since by that stage the user has 
already dedicated the necessary visual, cognitive and manual 
efforts to execute the task. In general, the performance of all 
evaluated predictors improves as the pointing hand/finger is 
closer to the display. This is particulary visible for the nearest 
neighbour model, which exhibits very poor performance early 
in the pointing task and gradually matches other techniques 
as the observed track length increases as the pointing finger 
becomes close to the endpoint. An exception is the BA model, 
where the reliability of the heading angle as a measure of intent 
declines as the pointing finger approaches the destination. 

The gains from combining the MRD motion model with the 
bridging technique (MRD-BD) are clearly visible in Figure 
compared to the MRD without bridging. This can be attributed 
to the ability of bridging models to reduce the sensitivity 
of linear destination reverting models to variability in the 
processed trajectories, which reduces the system’s sensitivity 
to parameter estimates and thus reduces parameter training 
requirements. MRD performance, e.g. compared to the NN, 
has notably deteriorated compared to that reported in HD 
since less parameter training is performed here; only 5 out of 


the tested 50 tracks are used for training. Similar observations 
are made for the ERV which has even more parameters than 
MRD; the quality of its predictions without bridging (not 
shown here) is very poor. 

Figure demonstrates that the proposed bridging- 
distribution-based inference delivers the highest overall correct 
destination predictions across the pointing trajectories. The 
highest aggregate successes are achieved by the constant 
velocity and equilibrium reverting velocity models with rel¬ 
atively tight error bounds. This is due to the importance of 
the velocity component in the pointing task, which is only 
captured by these two models. MRD without bridging has the 
largest variance, highlighting its lack of robustness without the 
bridging element. NN and BA performances are similar over 
the considered data set. 

It is important to note that small improvements in pointing 
task efficiency (effort reduction), even reducing pointing times 
by few milliseconds, will have substantial aggregate benefits 
on overall user experience since interactions with displays are 
very prevalent in typical scenarios, e.g. using a touchscreen in 
a modern vehicle environment to control the car infotainment 
system |fT3|, p4|. 


B. Maritime Situational Awareness 

Figure shows the results of the proposed destination 
inference algorithm, via the MAP estimate in (|^, for a two- 
dimensional vessel tracking problem. The aim is to predict 
the endpoint of each vessel, from a set of = 6 possible 
harbours, based on noisy observations t/i:„ of its trajectory up 
to the present time tn. The trajectory data is generated from 
a bridged constant velocity model, such that tracks begin at a 
random point in the middle of the bay (around 20km from the 
shore). It is conditioned on arriving at a chosen destination 
port (all equally likely) at a uniformly distributed random 
arrival time between 50 and 250 minutes later. Velocity at 
arrival is assumed to have a Gaussian distribution with a 
zero mean and standard deviation 10m min“^ (i.e. relatively 
slow). The dynamic noise parameter is cr = 20m min“^/^ 
in both dimensions. Observations are direct measurements of 
the current vessel position, corrupted by Gaussian noise with 
standard deviation of Im. 

Destination inference in Figure is performed using a 
bridged constant velocity model and the arrival time prior is 
uniform over the interval [50, 250] minutes, i.e. p{T \ D) = 
f^(50, 250). Quadrature was based on g = 15 evenly spaced 
arrival times over this time interval, employing Simpson’s rule 
integration as in ( |2^ . 

For most of the tracks depicted in Figure the correct 
intended destination is inferred early in the observed trajectory. 
When the algorithm fails to achieve such early predictions, 
e.g. vessels heading to harbours 2 and 4, visual inspection of 
these tracks shows that the vessel in question makes a notable 
sharp manoeuvre at some point in its trajectory. Prior to these 
sharp changes in direction of travel, the vessel appears to be 
heading towards a different endpoint. Nevertheless, after the 
manoeuvre is completed, the correct destination is quickly 
inferred. This figure clearly highlights the potential of the 
proposed bridging-distribution-based destination inference. 
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Figure 6: Destination inference for ten ship trajectories, heading to 
one of six numhered destinations. Thick green lines show the portion 
of each trajectory for which the true intended destination is correctly 
inferred via MAP estimate in thin red lines show portions of the 
trajectory for which this was not the case. 



Figure 7: Proportion of track for which the true destination was 
inferred for different numbers of quadrature points, using Simpson’s 
rule as in equation ( |23^ , averaged over 100 tracks (error bars show 
± one standard deviation). End times are uniformly distributed in 
the interval [50, 250], within which the quadrature points are evenly 
spaced. For one quadrature point T = 250 is assumed for all tracks. 


Figure examines the effect of the number of quadrature 
points q on the endpoint inference outcome for the above 
vessel tracking scenario. This figure shows the average and 
standard deviation of the aggregate prediction successes for 
100 randomly generated vessel trajectories using Simpson’s 
rule quadrature as in equation ( [2^ with varying numbers of 
quadrature points q. Figure illustrates that increasing the 
number of quadrature points (up to q = 9) improves the 
inference performance, after which the success rate levels 
off. The result for one quadrature point is that for assuming 
T = 250 minutes for all tracks. The results in this figure 



Figure 8: Arrival time estimate (across all possible destinations) for 
a track similar to those shown in Figure The red line shows the 
true arrival time and shading shows the posterior density at each time. 


demonstrate that assuming a distribution of arrival times is 
beneficial for destination inference compared to guessing a 
single arrival time. Most importantly, it shows that only a few 
quadrature points (e.g. 9 in this case) are sufficient to leverage 
this benefit. 

Figure shows the arrival time estimations for a track 
similar to those considered in Figure]^ applying the technique 
described in Section |IV-D| with g = 31 quadrature points. 
Initially, the arrival time is uncertain as shown by the diffuse 
shading. However, as more trajectory data becomes available, 
the posterior steadily becomes more concentrated in the region 
of the true arrival time (red line). 


Finally, the predicted target position at a number of future 
time instants t* > tn are displayed in Figure using the 
prediction method in Section IV-E At the time shown 
40 observations have been made. The tracked object’s true 
position up to each of the assessed future times is shown in red. 
Initially, the inference is unimodal, dominated by the object’s 
current motion. As predictions are made further into the 
future, the possible destinations of the target become visibly 
influential and the predicted position becomes multimodal (e.g. 
see the last row in Figure]^. Each of these modes corresponds 
to a destination d G D and the prediction is dominated by 
the endpoint(s) deemed to be more probable by the inference 
algorithm. Eor example, the available trajectory (blue line) 
leads to more weight being assigned to endpoints 4, 5 and 6 
compared to the other possible destinations (noting that d = 6 
is the true intended destination). 


The effect of the unknown arrival time T (estimated using 
q = 25 quadrature points) is reflected in Eigure|^by the shape 
of the predicted densities, which resemble a “Anger” shape 
pointing to each destination (particularly in the fourth panel at 
t* = 97). These ‘Angers’ result from an increased uncertainty 
about where the target is in its approach to each destination 
due to the unknown arrival time. As more observations are 
made, this effect diminishes due to the increased confidence 
in the arrival time, as per Eigure 
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t =40 


= 59 



Figure 9: Predicted target position. Shading shows the posterior 
distribution of the tracked object position at a range of future time 
instants t*. The available trajectory X\aq, i,e. at = 40, is depicted 
by the solid blue track. Top left panel shows the current position 
posterior at t„ = 40 and subsequent panels exhibit the predicted 
position posterior at times t* > 40, up to t* ~ T. Red line indicates 
the true trajectory values at t* (red dot is the true target position at 
t*). The arrival time and destination are unknown. 


Defining a normal trajectory template (pattern of life) for 
vessels moving towards potential docking points (e.g. over 
relatively short distances as in Section [V-B| i or protected assets 
can be challenging. This is due to the fact that the tracked 
object approach to the intended endpoint can significantly 
vary and might not necessarily conform to a known pattern 
(especially if the intent is malicious). Besides, building a 
complete training data set might not be possible for such 
scenarios. The proposed inference algorithms in this paper, 
which do not treat the intent inference problem as an anomaly 
detection problem, are able to handle such a setting effectively 
with consistent predictions (destination, position and time of 
arrival) through the use of bridging distributions. 

VI. Conclusion 

This paper sets out a probabilistic framework for a simple, 
low complexity intent inference that demands minimal training 
and is amenable to parallelisation. Utilising the bridging 
approach presented here not only permits earlier predictions, 
but also significantly improves the robustness of destination 
reverting models against variability in the tracked object be¬ 


haviour. The early inference of the destination, future position 
and time of arrival of the tracked object, e.g. pointing finger or 
vessel, can bring notable benefits such as reducing the attention 
required to interact with in-vehicle displays, enhancing the 
ability of maritime surveillance systems and facilitating super¬ 
vised or unsupervised automated warning/assistive functions. 
The inference results from the two applications considered in 
this paper testify to the effectiveness and usefulness of the 
introduced prediction algorithms. Whilst linear Gaussian mo¬ 
tion and observation models are considered here, extending the 
formulation to a more generic settings (nonlinear and/or non- 
Gaussian), could broaden its applicability to other scenarios, 
especially when targets undertake sharp manoeuvres prior to 
reaching their intended endpoint. This would require the use 
of Bayesian filtering techniques such as sequential Monte 
Carlo algorithms which will entail substantial additional 
computational cost. 
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